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Abstract 

We survey several results known on sampling in computational geometry. 

1 Introduction 

In this write-up, we are interested in how much information can be extracted by random 
sampling of a certain size for a range space of VC dimension d. In particular, we show that 
several standard results about samplings follow from the sampling theorem of Lin et al. 



LLS01 



The following assumes that the reader is familiar and comfortable with e-nets and e- 
approximations. The results surveyed in this write-up are summarized in Figure [D 

2 Preliminaries 

Lin et al. [LLSOlj consider more general functions, but in the settings we are interested 
in, their result can be described as follows. We are given a range space S = (X, 5F) of VC 
dimension d, where X is a point set, and J is a family of subsets of X. In out settings, we will 
usually consider a finite subset ICX and we will be interested in the range space induced 
by S on X. In particular, let iV be a sample of X. For a range ReJ, let 

,^ \RnX\ , , n . \Rf]N\ 

r = r(R)= — | and s = s(R) = — . 

Intuitively, r is the total weight of R in X, while s is the sample estimate for r. For a 
parameter v > 0, consider the distance function between real numbers 



r — s 
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Name 


Property VR G j 


Sample size 


e-net |HW87| 
Theorem l3.1l 


r (R) > e => s(R) > 




^-approximation VC71 
Theorem 13.21 


|r(R) -s(R)| < e 


°® 


Sensitive ^-approximation 
|Bro95 IBCM99j 
Theorem 142] 


|r(R)-s(R)| < -^/r(R) + e) 




Relative (e, ^-approximation 

|OKMS06| 

Theorem 15.21 


r(R) < p s(R) < (l + e)p 
r(R) > p =4> 

(l-e)r(R) < s(R) < (l + e)r(R) 


o(4V) 

\e 2 p pj 



Figure 1: Here, R denotes a range in the given range space, r(R) is the fractional weight of 
R, and s(R) is its fractional weight in the random sample. The samples have the required 
property (for all the ranges in the range space) with constant probability. 
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Theorem 2.1 ([L LSOl] ) Let a, v, 5 > be parameters, and let S = (X, 5F) be a range space 
with VC dimension d. Let X C X be a finite set. We have, that a random sample (with 
repetition) of size 

1 / .. 1 . r 



O — dlog- + log 

\a z v \ v 

from X has the property that 

VRg3 d v (r(R),s(R)) <a. 

And this holds with probability > 1 — 5. 

It is hard in the sea of parameters to see the trees, so let us play with the parameters a 

bit. 



3 Getting the £-net and ^-approximation theorems 

Theorem 3.1 ([H W87j . e-Net Theorem) A sample of size 0((d/e) log(l/e)) from X , is 
an e-net of S = (X, 5F), where X C X, and this holds with constant probability. 

Proof: Let a = 1/4, v = e, 5 = 1/4, and apply Theorem 12. 1[ The sample size is 

°(^( dlo ^ + log j)) =0 (? log ?)- 

Now, let R G 3" be a range such that |R n X\ > en, where n = \X\, we have that 

d v (r{R),s(R)) < a = 1/4 
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(with constant probably for all ranges). Namely, 

|r(R) -s(R)\ 



r(R) + s(R)+e 



< 1/4. 



The bad case for us, here is that r(R) > e, but s(R) = 0. But then, the above inequality 
becomes 

2 2s-r(R) + e 

which is, of course, false. Thus, it must be that s(R) > 0, which implies that N is indeed an 
£-net. ■ 

Theorem 3.2 ( |VC71| . e- Approximation Theorem.) A sample of size 

(J 5 (d + .ogI 

from X , is an e- approximation of S = (X, 5F), where X C X, and i/w's ftoWs w^/i probability 
> 1-5. 

Proof: Set a = e/4 and z/ = 1/4. We have, by Theorem 12.11 that for any R 6 5F, it holds 

\r — s\ < — (r + s + z/) < e, 
implying the claim. ■ 

4 Sensitive ^-approximation 

Another similar concept was introduced by |BCM 99j. 

Definition 4.1 A sample N C X is sensitive e -approximation if 

VRe^ |r(R) -s(R)| < Kv 7 ^) +e) • 

Observe that a set iV which is sensitive ^-approximation is, simultaneously, both an e 2 -net 
and an e-approximation. 

The following theorem shows the existence of sensitive ^-approximation. Note that the 
bound on its size is (slightly) better than the bound shown by |Bro95l |BCM99| . 

Theorem 4.2 A sample N from X of size 

(I(dlogi + logi 

is a sensitive e- approximation, with probability > 1 — 5. 
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Proof: Let v { = ie 2 /800, = ^/T/4~i } for % = 1,...,M = [800/e 2 ]. As such, for 
i = 1, . . . , M, we have afui = e 2 /1600. Consider a single random sample N of size 

tf = o(i(dlogi + logf))= (I(dlogi + logi 

It is a sample complying with Theorem 12.11 with parameters Ui and aii, with probability at 
least 1 — S/M, since 

of * (dlog± + log^ =o(Vdlog- + log^n > | =0(Z7). 



Q<i^i V $ J J \ £2 \ £ $ 

Namely, Theorem 12.11 holds for N, with probability at least S, for parameters oti and z/j, for 
alH = 1,...,M. 

Next, consider a range Re J, such that r = r(R) G [(z — l)e 2 /800, ie 2 /800] and s = s(R). 
We assume for the sake of simplicity of exposition that i > 1, as this case can be handled 
similarly to the more general case. This implies that z/,/2 < r < z^, and as such 



We have that 



, i \ \r - s\ 

d Ui {r, S) <OLi ; ; < «j 

r + s + fi 



If s < Vj, we have that 

\r — s 



E\/r _ £ 



which implies that N is indeed sensitive ^-approximation. Otherwise, if s > V{ > r, then we 
have 

s — r < «j(r + s + =>> (1 — (Xi)(s — r) — c^r < a!j(r + z/j) 

=^ (I - on){s - r) < on{2r + Ui) 

aAlr + uA , . 

(1 - Oi) 

since oti < 1/2. As such, by Eq. (pp), we have 



eJr eJr 

\s — r < oa^i < 6 < , 

1 1 ~ 20 ~ 2 ' 



which implies the claim. 



Looking on the bounds of sensitive ^-approximation as compared to ^-approximation, its 
natural to ask whether its size can be improved, but observe that since such a sample is also 
an e 2 -net, and it is known that fl(d/e 2 log(l/e)) is a lower bound on the size of such a net 



KPW92| . this implies that such improvement is impossible. 
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5 Relative ^-approximation 

Definition 5.1 A subset iV C X is a relative (p,e)- approximation if for each R 6 J, we 
have: 

(i) If r(R) > p then 

(1 - e)r(R) < s(R) < (1 + e)r(R). 

(ii) If r(R) < p then s(R) < (1 + e)p. 

The concept was introduced by |CKMS06| . except that property (ii) was not required. 
However, property (ii) is just an easy (but useful) "monotonicity" property that holds for all 
the constructions I am aware of. 

There are relative approximations of size (roughly) l/(e 2 p). As such, relative approxi- 
mation is interesting, in the case where p « e. Then, we can approximate ranges of weight 
larger than p with a sample that has only linear dependency on l/p. Otherwise, we would 
have to use the regular p-approximations, and there the required sample is of size (roughly) 
\/p\ 

Theorem 5.2 A sample N of size 0\ ^— f d log — h log — J J is a relative (p, e) -approximation 

\e 2 p\ p 5JJ 

with probability > 1 — 5. 

Proof: Set v = p/2, and a = e/9, and apply Theorem 12 . 1L We get that, for any range 
Re J, such that r = r(R) > p and s = s(R), it holds 

\r — s\ £ 

dJr,s) = <a =^ \r — s\ < -(r + s + p/2). 

r + s + v 9 

If s < r then, since r > p, we have that 

£ £ 

\r — s\ < —{r + s + p/2) < — 3r < er, 

which implies property (i). Otherwise, s > r, and then 

s-r < -(r + s + p/2) (1 - e/9)s < (1 + e/9)r + p/2. 
9 

This implies that 

l + e/9 1 4 9 

1 — e/9 2(1 — 5/9) 3 16 
Thus, s < 3r, which implies that 

£ £ 

\r — s\ < —(r + s + p/2) < -5r < er, 
I 9 Fl ' ~ 9 ~ 

which again implies property (i). 

As for property (ii), if r < p then we need to show that s < (1 + s)p, and this follows 
easily from the above calculations. ■ 

In fact, one can slightly strengthen the concept by making it "sensitive". 
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Theorem 5.3 A sample N of size 0[ ( d log — h log — J J is a relative (ip, el \fi)-approximat 

Wp\ V oJJ 
with probability > 1 — 5, for all i > 0. 

Namely, for any range ReJ, smc/i t^at r(R) > ip, u>e /zawe 

l-^r(R)< S (R)<(l + ^r(R). (3) 



Proof: Set p« = ip and Ej = e/y/i, for z = 1, . . . , Now, apply Theorem 15.21 and observe 
that all the samples are of the same size, and as such one can use the same sample to get 
this guarantee for all i. ■ 

Interestingly, sensitive approximation imply relative approximations. 

Lemma 5.4 Let s,p > be parameters, and let e' = Syfp. Then, if N is sensitive e' - 
approximation to the set system (X, 3^) then its also a relative (e,p)- approximation. 

Proof: We know that VR G 3 it holds |r(R) - s(R)| < ^(y/rjR) + /) . As such, for 
R G 3, if r(R) = ap and a > 1, then we have 

|r(R) - s(R)| < £ -^{^P + ey/p) = ^ + |v^P < + |) op < er(R), 
which implies that A r is a relative (5, ^-approximation. ■ 
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